In preparation for submission to Solar Physics 
http: / /www. springerlink.com/content/0038- 0938 



Automatic Detection of Limb Prominences in 304 A 
EUV Images 

N. Labrosse 1 • S. Dalla 2 • S. Marshall 3 

Received: ••••••••••• / Accepted: ••••••••••• / Published online: •••••••••• 



Abstract A new algorithm for automatic detection of prominences on the so- 
lar limb in 304 A EUV images is presented, and results of its application to 
SOHO/EIT data discussed. The detection is based on the method of moments 
combined with a classifier analysis aimed at discriminating between limb promi- 
nences, active regions, and the quiet corona. This classifier analysis is based on a 
Support Vector Machine (SVM). Using a set of 12 moments of the radial intensity 
profiles, the algorithm performs well in discriminating between the above three 
categories of limb structures, with a misclassification rate of 7%. Pixels detected 
as belonging to a prominence are then used as starting point to reconstruct the 
whole prominence by morphological image processing techniques, ft is planned 
that a catalogue of limb prominences identified in SOHO and STEREO data 
using this method will be made publicly available to the scientific community. 
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1. Introduction 

Solar prominences are large structures confining a cool and dense plasma in 
the hot, tenuous solar corona. Once they are formed, they may remain rela- 
tively stable for long periods, over a solar rotation or more. The exact pro- 
cesses taking place during formation, evolution, and disappearance are, however, 
not well understood. Most studies dedicated to solar prominence physics so 
far have dealt with individual examples of well-observed structures, and there 
have been very few attempts to make large statistical studies of their proper- 
ties ( |Gilbert et at, 200fj| |Mackay,"G aizaus kas, and Yeates, 2008[ for example). 
There are still large uncertainties over prominence physical parameters, and it 
is not clear how some plasma properties may vary from one prominence to one 
other (see the review by [Patsourakos a nd Viall I2002|) . 
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Understanding the cause of the disappearance of prominences is essential 
in many respects. Prominences usually disappear during an eruption where 
a great amount of matter and energy is released within a few hours. These 
dynamic events are sometimes linked to coronal mass ejections (CMEs) and/or 
flares. CMEs and flares are a major source of disturbances in the interplanetary 
medium, which is of course critical for human activities in space, including 
around the Earth. Erupting filaments (seen on the disk from the Earth) can be at 
the origin of Earth-directed CMEs and flares. One major issue faces those trying 
to understand prominence properties and to unveil the mechanisms leading to 
the eruptions: the search for a large number of relevant observations. At present, 
no catalogue of off-limb solar prominences is available to scientists: our primary 
objective is to generate such a catalogue and to make it available within the 
Virtual Observatory. In order to create a catalogue of solar prominences and 
facilitate this task, we adopt the following principles: 

• We use data from spacecraft that perform near-continuous observations of 
the Sun in order to make a list of as many events as possible. We want to 
use full-Sun images in order to catch all events at any one time. The best 
instruments corresponding to this description are so far the Extreme Ultra- 
violet Imager (EIT) on the Solar and Heliospheric Observatory (SOHO), 
and the EUVI on STEREO/SECCHI. These datasets are publicly available. 
The AIA imaging suite on SDO will also match these criteria once launched 
and in operation. 

• Prominences are best observed from space in the resonance line of He n 
at 304 A. This line is (or will be) observed by the three instruments 
mentioned above (EIT, EUVI, AIA). EIT and EUVI also have other chan- 
nels corresponding to a higher temperature emitting plasma (typically the 
coronal plasma). In these channels, prominences are usually seen as dark 
features, because they absorb the coronal radiation from behind them, and 
because the cool prominence plasma is not hot enough to emit in these 



lines ( Anzer and Heinzel, 2005 Heinzel et al., 2008). AIA on SDO will also 



have additional channels where prominences and filaments are expected to 
be seen in absorption. 

In this work we will be looking at off-limb structures only. The detection of 
structures on the disk is deferred to a later study. The automatic detection 
of solar prominences and the ensuing construction of a solar prominence cat- 
alogue will provide researchers with interests in understanding the formation, 
stability, and disappearance of prominences and filaments with a new tool. It 
will also be valuable for studying the links between the different manifesta- 
tions of large-scale eruptions on the Sun: eruptive prominences and filaments, 
CMEs, flares. It will be possible to link the prominence catalogue with filament 
dZharkova et al, 2005] ) and flare QAschwanden et al., 2009[ ) and CME ( |Gopalswamy et al., 2009[ | 



Robbrccht, Bcrghmans, and Van der Linden, 2009) catalogues. Having access to 



a large statistical sample of these events will make it easier to check theories 
against observations. In addition to solar prominences, our algorithm identifies 
active regions on the limb, and this information will also be compiled into a 
catalogue. 
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In recent years, there has been considerable interest in developing methods for 
automated detection of prominences. The majority of efforts focussed on the de- 
tection of prominences on the solar disk, i.e. filaments. |Bernasconi, Rust, and Hakim|| 
(|2005p used a method based on thresholds and morphological filtering to detect 
filaments in Ha images. The EGSO Solar Feature Catalogue included filament 
information, obtained from Ha data by means of image enhancement and thresh- 
olding followed by a region-growing procedure ( |Fuller, Aboudarham, and Bentley, 2005[ ).[ 
Issues related to the tracking of filaments over consecutive solar images have 
been discussed bv I Abo udarha m" et al.\ (|2008|) . Detection of filaments using EUV 
data has been achieved using the four wavelengths of SOHO /EIT complemented 
by magnetogram data, by means of an image segmentation technique using a 



combination of region-based and edge-based methods (Scholl and Habbal, 2008). 

As far as prominences on the limb are concerned. IFoullon and Verwichtel 
piKJrJl) considered SOHO/EIT images at 171, 195, 284, and 304 A, and applied 
multiple techniques, including bright histogram segmentation and edge detection 
to automatically identify prominences, making use of the fact that active regions 
contribute to emission in all four wavelengths, while prominences appear bright 
only in the 304 A images. Other studies focussed on the detection of limb 
prominence activity (e.g. eruptions) using data from the Nobeyama Radiohe- 



liograph (Shimojo et at, 2006) and Ha images from Big Bear Solar Observatory 
QFu, Shih, and Wang, 2007j 

The technique of IFoullon and Verwichtel (|2006[) suffers from the necessity of 
having a complete set of images taken at various wavelengths in order to find 
the off-limb prominences. Here we are interested in limiting the detection of off- 
limb structures to one wavelength only. The main argument for this is that this 
method will be more widely applicable to a range of instruments as long as they 
observe in this particular wavelength (here 304 A) . The success will not depend 
on the availability of other types of observations. There are also issues regarding 
processing speed, which are crucial if we want to apply this technique for on the 
fly prominence detection on high-cadence SDO images. 

This paper presents a new method for detection of limb prominences, based 
on moments of radial intensity profiles in 304 A images. The method has so 
far been applied to SOHO/EIT images, and its extension to STEREO/EUVI 
and SDO/AIA data is envisaged in the near future. In Section [5] we describe 
the intensity profiles that are extracted from pre-processed EIT images and the 
moments of the profiles that are used for the classification of off-limb structures. 
In Section [3] we describe the training data set which is used to train a Sup- 
port Vector Machine, and the results from the classification. Section [4] presents 
the application of this classification algorithm to the automatic detection of 
prominences and Section [5] discusses the reconstruction of these features by 
morphological image processing techniques. Finally, we discuss our results in 
Section [6l 



2. Radial Intensity Profiles 

EUV images at 304 A reveal prominences as structures of enhanced brightness 
compared to the surrounding quiet corona. The automatic detection of promi- 
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Figure 1. Radial profiles and intensity histogram for a prominence. 



nences in such an image is however greatly complicated by the frequent presence 
in the same image of active regions (ARs), where emission is also enhanced 
compared to quiet Sun areas. There are some differences in the morphology 
and appearance of prominences and ARs: the latter are typically brighter and 
more compact. In addition, the temperature at which ARs emit is different than 
for prominences, and therefore the response of the instrument should also be 
different for these structures. 

Our detection method aims to characterise differences between prominences 
and other types of regions (either ARs or quiet corona) by analysing radial in- 
tensity profiles and calculating their moments. Although radial intensity profiles 
for prominences and ARs may appear similar to the eye, actual extraction of 
these profiles shows that they are not, and their moments allow us to capture 
their differences. 

To extract the radial profiles, we calibrate the SOHO/EIT images using the 
standard eit_prep procedure and transform to polar coordinates (r, 9) where r 
indicates the radial coordinate (distance from the centre of the solar disk as 
specified in the image's FITS headers) and 9 the angular coordinate. We then 
obtain a radial cut at a given angular location 9q by extracting all points with 
angular coordinate within ±1° of 9q and with radial coordinate r satisfying 
1.01 < r < 1.35 R Q . 

2.1. Example Radial Intensity Profiles 

A typical radial intensity profile for a prominence is shown in Figure [TJ The 
left panel of the figure shows the intensity profile normalised to the sum of all 
intensity values in the profile. A double power law is fitted to the data points 
(solid line). The right panel of Figure Q] shows the intensity histogram for all 
points in the radial profile. 

Figure [2] shows radial intensity profiles and the intensity histogram for an AR, 
with the same format as in Figure [1] Comparing Figures Q] and [21 one notices 
that the prominence profile displays a rather sharp boundary at about 1.08 Rq, 
while emission from the AR gives a smoother curve. The comparison of the 
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Figure 2. Radial profiles and intensity histogram for an active region. 




Figure 3. Radial profiles and intensity histogram for a quiet corona region. 



intensity histograms shows that, as one would expect, AR emission is typically 
characterised by higher intensities. 

Figure [3] shows the radial intensity profile for the quiet corona and the corre- 
sponding intensity histogram, characterised by much lower intensities than for 
the prominence and AR examples, as well as a different variation with r. 

2.2. Moments 

Moments represent a powerful way to capture the features of the radial intensity 
profiles and intensity histograms described in Section \2. II We experimented with 
a variety of possible definitions of moments, and identified two sets that allow 
us to best discriminate between different limb structures. 

A first set of moments is obtained from the intensity histograms of the profiles 
as follows: we define as yk the starting location of the fc-th intensity bin in the 
histogram and as nk the number of pixels in the radial profile with intensity 
falling in the k-th bin. The following set of six moments is then introduced: 

^ = 4E(w*-°) ron * ' rn = 0,...,5 (1) 

k 
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with 

a= E^± (2) 

where a is the average intensity and N the total number of pixels in the profile. 

The second set of moments is calculated from the radial intensity profiles by 
means of the following procedure: we indicate as rj the radial position of i-th 
data point in the profile, as yi its intensity and we define normalised intensities 
(as displayed in the left panels of Figures [TH3]) as 

ym = ■ (3) 

We fit each normalised intensity profile with a double power law (solid line in 
Figures [T]-[3]) defined by: 

yf lt (r)=aor ai +a 2 r a3 . (4) 

We can then obtain a new variable z% = \yNi~ Ufit[fi)\ and calculate its moments, 
defined by: 



^ N 



ith 



lj2( r i-b) m Zi , m = 0,...,5 (5) 



E* «• 

This provides a second set of six moments. 



(6) 



3. Classification 

Our aim is to obtain a decision rule that will allow us to classify a given ra- 
dial profile as belonging to one of three classes, either "Prominence" , "AR" , or 
"Other" , depending on the values of its moments. Once a given profile has been 
identified as of prominence type, then it will be possible to retrieve the entire 
prominence by means of a region-growing technique. 

3.1. Training Data 

To obtain the required decision rule, we first assemble a datasct of EIT im- 
ages that will be used as training data for the algorithm. We select a set of 
25 SOHO/EIT 304 A images consisting of approximately one image every six 
months between 1996 and 2006, covering a full solar cycle. 

We extract ten radial profiles per image, for a total of 250 radial profiles, that 
we classify by eye as either "Prominence" , "AR" , or "Other" . We also use visual 
inspection of SOHO/EIT images at 171 A and 195 A to distinguish more clearly 
between cool and hot structures. In principle, prominences will appear bright 
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in the 304 A channel, and will be dark or invisible in the other two channels. 
Active regions will appear bright in all three EUV channels. 

We randomly choose 167 profiles as the training set and feed the other 83 to 
our classification algorithm as unlabelled profiles in order to test its performance. 



3.2. Support Vector Machine (SVM) 



Simple linear classifiers plot data samples in a multidimensional feature space 
and seek to classify them by positioning a linear hyper-plane such that it parti- 
tions them into two different groups. In practice there can be many hyper-planes 
that partition the data and some of these may require only a small perturbation 
in some of the data samples to lead to misclassification. 

Support Vector Machines, or SVMs (Cristianini and Shawe- Taylor, 2000), are 
more sophisticated in that they partition the data using two parallel hypcr-plancs 
and adjust these so that the planes are the maximum distance apart without mis- 
classifying any of the data samples. In practice it may be that it is not possible to 
separate a given set of data samples with a linear plane. In these cases a number 
of extensions exist which allow the SVM to deal with this situation. One approach 
is the so called soft classifier approach ( Cortes and Vapnik, 1995 ) which allows a 
limited number of samples to be misclassificd and seeks to minimise this number. 
Another approach is the so called kernel trick ( Aizerman, Braverman, and Rozonoer, 1964[ )[ 
that maps data into a higher-dimensional space in which it is linearly separable. 

SVMs represent a powerful technique for general classification, regression, and 
outlier detection with an intuitive model representation. For binary classification, 
we look for the optimal separating hypcrplane between two classes by maximizing 
the margin between the classes' closest points. The points lying on the boundaries 
are support vectors, and the middle of the margin is the optimal separating 
hyperplane. When we cannot find a linear separator, data points are projected 
into a higher-dimensional space where they effectively become linearly separable. 
The whole task can be formulated as a quadratic optimization problem which 
can be solved by known techniques. A program able to perform all these tasks 
is called a Support Vector Machine. 

In this work we use the libsvm C++ implementation of SVMs (Cha ng and Lin, 2001[ )[ 
through an interface using the R language (jhttp:/ /www. stats. bris.ac.uk/R/web/packages/el0 71/[ ).[ 
The kernel used in the algorithm for training and prediction is a radial basis func- 
tion, parametrized by one constant. Therefore we have two free parameters for 
training the SVM. The other parameter is a penalizing parameter for constraints 
violation (points lying between the margins). The choice of the parameters is 
crucial for obtaining good results, and we have therefore conducted trials over a 
large grid-space to find the best results. 



3.3. SVM results 



Table [T] displays the results of the SVM analysis using the first set of moments as 
defined in Equations. ([I])-©. The true number of prominences, as determined 
by our visual identification, is 39. Of these, 30 arc correctly identified by our 
algorithm. The true number of ARs is 21 and the algorithm correctly identifies 
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Table 1. SVM results for the first set of moments. 



AR (true) 


Other (true) 


Prominence (true) 


AR (predicted) 21 





4 


Other (predicted) 


22 


5 


Prominence (predicted) 


1 


30 


Table 2. SVM results for the two sets of moments. 


AR (true) 


Other (true) 


Prominence (true) 


AR (predicted) 19 








Other (predicted) 


21 


1 


Prominence (predicted) 2 


3 


34 



all of them. The remaining profiles, corresponding to the quiet corona, are 23 
in number, 22 of them are correctly classified by the SVM algorithm. The total 
number of misclassificd profiles is ten, giving a misclassification probability of 
12%. 

In order to improve the success of the algorithm, i.e. to decrease the mis- 
classification probability, we now use the second set of moments defined in 
Equations. (J5j) — ((6]) in combination with the previous one. The SVM algorithm 
is now trained with 12 moments describing each radial profile. Table [2] displays 
the results of the SVM analysis using the two sets of moments. Note that the 
83 profiles which are used for testing the algorithm are not the same as for 
Tabic [TJ since they are chosen randomly. This is the reason why the number of 
true profiles in each category is not the same as in Table [TJ The true number 
of prominences, as determined by our visual identification, is 35, of which 34 
are correctly identified by the algorithm. The true number of ARs is 21 and 
the algorithm correctly identifies 19 of them. 24 out of 27 remaining profiles 
corresponding to the quiet corona are correctly classified. The total number of 
misclassified profiles is now six, giving a misclassification probability of 7%. 

4. Detection of Limb Prominences 

The steps required to create a catalogue of solar prominences from the EIT 
database of observations at 304 A, are as follows: 

1. Given a new EUV image, we select a number of angles (up to 90 to have 
a sufficient coverage of the limb) at which we compute the radial intensity 
profiles and the associated moments. 

2. We use the now-trained SVM algorithm to automatically decide what type of 
structure is present. 

3. Where prominences are detected, we apply growing region techniques to 
recover the whole structure. 
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Figure 4. Full-Sun EIT image at 304 A on 31 May 1996 showing detected prominences (P) 
and active regions (AR). 

4. We calculate barycentre, area, and angular / radial extension of the whole 
structures found on the image. 

5. We proceed to a subsequent image. A comparison between subsequent images 
enables us to track any changes in time for a given prominence, such as 
disappearance due to the passage on the disk or behind the limb, eruption, 
or other types of disappearance. 

We illustrate the first two points above. We now consider all 25 images that 
were used to manually label the intensity profiles as described in Section 13.11 
For each image we construct 90 profiles equally spaced by four degrees. We then 
calculate the 12 moments of the intensity profiles as discussed in Section 13.31 
Two examples are shown in Figures 0] (close to solar minimum) and [5] (close to 
solar maximum). Figure [4] shows that all prominences are detected. In addition, 
an active region on the east limb is also detected. Prominences are also detected 
on the southeast limb. Figure [5] shows a full-Sun image closer to the maximum 
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Figure 5. Full-Sun EIT image at 304 A on 30 May 1999 showing detected prominences (P) 
and active regions (AR). 



of the solar-activity cycle. It is obvious that the background diffuse emission 
of the corona is brighter than earlier in the solar cycle (Figure [4J . Despite the 
lower contrast between prominences and the corona, all visible prominences are 
detected as such. All active region loop systems are detected as such. There are a 
couple of less clear cases of prominence or AR detection, that might be discarded 
at the stage of catalogue production because they do not meet the criteria for 
inclusion (i.e. they will not meet the minimum size requirement). 

It takes about 17 seconds to apply the standard SSW corrections on an 
EIT image, subsequently take 90 radial profiles, and classify them using the 
SVM algorithm. We expect to be able to reduce this time thanks to various 
optimisations in the codes. Also, we will aim to reduce the number of profiles 
(less than 90) while still keeping a good coverage of the solar limb. 

In the next Section we apply region-growing techniques to reconstruct the 
whole prominence once a detection is made. 
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Figure 6. Example of morphological opening by reconstruction for a prominence observed by 
SOHO EIT on May 26 2006. The left panel shows the subfield of the original image on which 
the reconstruction is run. The right panel shows the output. 

5. Prominence Reconstruction 

Once our detection procedure has identified that a prominence exists at a partic- 
ular angular location, we automatically reconstruct the entire prominence using 
morphological image processing. This allows us to obtain the pixels that belong 
to the prominence and extract its characteristics in the next step. 

We start by applying morphological opening to the image to remove spu- 
rious bright pixels (e.g. cosmic ray hits). To start the reconstruction, we use 
the information from the detection algorithm to identify a starting pixel (the 
marker) that belongs to prominence. Next we apply morphological opening by 
reconstruction using this marker and a circular structure element with radius of 
10 pixels. 

Figure [5] shows an example of prominence reconstruction obtained using our 
method. The pixels belonging to this complex prominence are identified. The size 
of the subimage shown is 277x306 pixels and it is likely that the three separate 
regions identified belong to the same prominence. 

6. Discussion and Conclusions 

In this paper we present the first attempt to automatically detect solar promi- 
nences above the limb using only one type of observation, namely EUV images 
at 304 A. Despite the fact that prominences and other coronal off-limb struc- 
tures such as active regions sometimes look similar on standard EIT images, the 
method of the moments of the intensity profiles, coupled with a Support Vector 
Machine for the classification of the moments in three categories, is working and 
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yields a small rate of misclassification. While our initial aim was to detect and 
construct a catalogue of limb prominences from the 304 A images produced by 
EIT - and also from STEREO/EUVI - it appears that this method is also 
effective in identifying active region loop systems above the limb. 

The biggest challenge in this exercise is to remove the effect of the background 
corona. The diffuse emission that we observe is a mixture of true emission 
(as we can observe that it is brighter around the solar maximum than at so- 
lar minimum) and of the instrument's scattered light. At present there is no 
automatic procedure to remove the latter component from the images, but 
observations performed during the Mercury transit of 15 November 1999 may 



help (Auchere and Artzner, 2004). Removing the instrumental component of the 
scattered light will increase the contrast between the background corona and the 
other structures. 

We plan to run our detection algorithm on the entire SOHO/EIT dataset of 
304 A images to generate a catalogue of limb prominences, to be made publicly 
available to the scientific community via a web page and via Virtual Observatory 
interfaces developed by the AstroGrid project. We will also be implementing our 
detection method on 304 A images from STEREO/EUVI and SDO/AIA for the 
generation of a limb prominence catalogue for these missions. The resulting 
catalogue can be used to perform a statistical study of, e.g., the large-scale 
properties of prominences as seen on the limb over several solar cycles (thanks 
to the continuity between EIT, EUVI, and AIA observations). The relationship 
between limb prominences and CMEs will also be investigated. 
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